Learning Robotic Manipulation Tasks via Task Progress Based Gaussian Reward and Loss Adjusted Exploration
نویسندگان
چکیده
Multi-step manipulation tasks in unstructured environments are extremely challenging for a robot to learn. Such interlace high-level reasoning that consists of the expected states can be attained achieve an overall task and low-level decides what actions will yield these states. We propose model-free deep reinforcement learning method learn multi-step tasks. introduce Robotic Manipulation Network (RoManNet) 1 , which is vision-based model architecture, action-value functions predict action candidates. define Task Progress based Gaussian (TPG) reward function computes on lead successful motion primitives progress towards goal. To balance ratio exploration/exploitation, we Loss Adjusted Exploration (LAE) policy determines from candidates according Boltzmann distribution loss estimates. demonstrate effectiveness our approach by training RoManNet several robotic both simulation real-world. Experimental results show outperforms existing methods achieves state-of-the-art performance terms success rate efficiency. The ablation studies TPG LAE especially beneficial like multiple block stacking.
منابع مشابه
Vision Based Robotic Interception in Industrial Manipulation Tasks
In this paper, a solution is presented for a robotic manipulation problem in industrial settings. The problem is sensing objects on a conveyor belt, identifying the target, planning and tracking an interception trajectory between end effector and the target. Such a problem could be formulated as combining object recognition, tracking and interception. For this purpose, we integrated a vision sy...
متن کاملthe relationship among efl learners’ autonomy, first language essay writing tasks and second language essay writing tasks in task/content based language instruction
the ability of composing a coherent and extended piece of writing in second language is considered as a fundamental factor to convey information and ideas of learners through the academic issues. although learners may achieve a perfect academic writing skill through assigning the l2 tasks in content based instruction, but demonstration of their abilities may be related to their ability in l1 es...
15 صفحه اولPrediction learning in robotic manipulation
This thesis addresses an important problem in robotic manipulation, which is the ability to predict how objects behave under manipulative actions. This ability is useful for planning of object manipulations. Physics simulators can be used to do this, but they model many kinds of object interactions poorly, and unless there is a precise description of an object’s properties their predictions may...
متن کاملTask Planning for Robotic Manipulation in Space
Authors' current addresses: A.C. Sanderson, Department of Electrical, Computer and Systems Engineering, Rensselaer Polytechnic Institute, Troy, NY 12180; M.A. Peshkin, Department of Mechanical Engineering, The Technological Institute, Northwestern University, 2145 Sheridan Rd., Evanston, IL 60201; L.S. Homem de Mello, Robotics Institute, Department of Electrical and Computer Engineering, Camegi...
متن کاملPreschoolers evaluate risk and reward in exploration-exploitation tasks
Children are drivers of their own discovery. To develop a complete characterization of the factors that drive exploration in early childhood, we must first understand how competing factors influence children’s decision making. We investigated preschool-aged children’s decision-making on explore-exploit tasks where the available information about the distribution of rewards was controlled. When ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE robotics and automation letters
سال: 2022
ISSN: ['2377-3766']
DOI: https://doi.org/10.1109/lra.2021.3129833